On Wasserstein geometry of the space of Gaussian measures 



Asuka TAKATSU 
Abstract 

The space of Gaussian measures on a Euclidean space is geodesically convex in the L^-Wasserstein 
space. This space is a finite dimensional manifold since Gaussian measures are parameterized by 
means and covariance matrices. By restricting to the space of Gaussian measures inside the L^- 
Wasserstein space, we manage to provide detailed descriptions of the L^-Wasserstein geometry 
from a Riemannian geometric viewpoint. We first construct a Riemannian metric which induces 
the L^-Wasserstein distance. Then we obtain a formula for the sectional curvatures of the space 
of Gaussian measures, which is written out in terms of the eigenvalues of the covariance matrix. 



1. Introduction 

In this paper, we give a formula for sectional curvatures of the space of Gaussian measures 
on Mf^ with the L^-Wasserstein metric. Let N{m, V) be the Gaussian measure with mean m 
and covariance matrix V. Namely m is a vector in R'' and is a symmetric positive definite 
matrix of size d and its Radon-Nikodym derivative is given by 



dN(m, V) 1 

; = , = exp 

dx ^dct(27rF) 



— -{x — m,V ^{x — m)) 



Wc denote by the space of Gaussian measures on W^. Since Gaussian measures depend only 
on the mean to and the covariance matrix V , the space N'^ is identified with x Sym^(d, M), 
where Sym^(d, M) is the space of symmetric positive definite matrices of size d. 

The L^-Wasserstein space is the subspace of probability measures equipped with a certain 
distance. Let 7^|'^(K'') be the set of absolutely continuous probability measures with finite 
second moments on W^. Then 7'f'^(R'') is a geodesic space and all geodesies are given by push- 
forward measure. In view of these facts, Otto fl3| regarded 7^f'^(R'^) as an infinite dimensional 
formal Riemannian manifold and analyzed the porous medium equations as gradient flows on 

A foundation for this framework was carefully laid out by Carrillo-McCann-Villani [3]. They 
introduced the new space, Riemannian length space. In short, this space is a length space which 
has an exponential map defined on some tangent vector space with a metric. They proved that 
Pf'^(R'') is a Riemannian length space and its metric induces the i^-Wasserstein distance. We 
call this metric the i^-Wasserstein metric. 

McCann showed in [9] that varying the mean is equivalent to a Euclidean translation and Nq 
is a geodesically convex subspace of 7^f'^(K'^). When we consider the L^-Wasserstein geometry 
on A/''', it suffices to consider the geometry on covariance matrix variations. We use A/jf for 
the set of all Gaussian measures with mean 0. We denote by N{V) the Gaussian measure with 
mean 0, and covariance matrix V . 
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In the Riemannian length space, if a geodesic from N{V) with direction ip passes thorough 
N{U), then the gradient of -0 is given as a hnear map associated with a symmetric matrix 
depending only on V, U. Thus the tangent space at each point can be regarded as the space 
of symmetric matrices Sym((i, R). This identification coincides with the viewpoint from the 
differential structure; since Aq* is identified with Sym^((i, R), which is an open subset of 
Sym((i, R), we can consider the tangent space oi Mq as Sym((i, R). 

These observations enable us to obtain a formula for the sectional curvature of JVq by 
restricting to a geodesically convex submanifold Nq of 7'|'^(R'*). 



Theorem 1.1. For an orthogonal matrix P and positive numbers {Xi}f^i, we set V = 
Pdiag[Ai, . . . , XdYP, where is the transpose matrix of P. Then we can consider the tangent 
space to A/'q at N{V) spanned by 



P{Eii+ Em) ^P _P [Eu - Ejj yP ^ _P {E,, + E,,) ^P I 

^ l<i<j<'^ 



where Eij is an (i, j)-matrix unit, whose {i, j)-component is 1, elsewhere. Then we obtain 
the following expressions of the sectional curvatures with respect to the vectors: 

/v(e+,e,,) = (1) 

Kie+jM) = (2) 

^^-^'^-)- (A. + A.m+A.) i.^lorj.,) (3) 

Kie+Jki)^0 {l<k<l<d) (4) 

K{e,j,eki) = Q (5) 

^(e,„./fei) = ({z,i}n{fc,Z} = 0) (6) 



A'(/,j,/h) = i) ({!,i}n{t,i) = ») (9) 



The formula coincides with a formal expressions of sectional curvatures of 7'f'^(R'^) given by 
Otto |13| . This formula shows that the sectional curvature of A/Jf is non- negative and is written 
out only in terms of the eigenvalues of the covariance matrix. 

The organization of the paper is as follows. We start with a review of the i^-Wasserstcin 
geometry and the Riemannian length space in Section 2. Then we prove Theorem 11.11 in 
Section 3, using the approximate expression of sectional curvature. We demonstrate the 
correspondence between our results and previously obtained result in Section 4. 
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2. Preliminaries 



2.1. L'^-Wasserstein space 

We first review L^-Wasserstein spaces (see }16| .) Given a complete metric space {X,d), we 
denote by 7^2 (^) the set of probability measures with finite second moments on X. 



Definition 2.1. For fi, v ^V2{X), a transport plan tt between n and v is a Borel 
probability measure on X x X with marginals /i and v, that is, 

tt[Ax X]^ ^[A], tt[X y. A] ^ v[A] for all Borel sets A in X . 

Let n(yu, v) be the set of transport plans between fjL and v, then the -Wasserstein distance 
between ^ and v is defined by 



W2{li,vY = inf 



d{x,y)'^dTT{x,y). 

XxX 



The L^-Wasserstein distance actually becomes a distance. We call the pair {'P2{X), W2) the 
L^-Wasserstein space over X . A transport plan which achieves the infimum is called optimal. 
Optimal transport plans on Euclidean spaces are characterized by the following properties. 



Theorem 2.2 ([2], [6]). Let fi and v be Borel probability measures on M."^. If ^ is absolutely 
continuous with respect to Lebesgue measure, then 

(i) there exists a convex function tp on M.'^ whose gradient Vip pushes n forward to v. 

(ii) this gradient is uniquely determined (^-almost everywhere.) 

(iii) the joint measure tt = (id x \/ip)f^^ is optimal. 

(iv) TT is the only optimal measure in n(^, unless VK2(/i, ly) = +00. 

Here the push forward measure of through measurable map / : ^ M'', denoted by f^fj,, is 
defined by /tj/i[A] = /^[/-^(A)] for all Borel sets A in 



McCann [9] obtained the optimal transport plans between Gaussian measures on M'' and 
showed that the displacement interpolation between any two Gaussian measures is also a 
Gaussian measure. Namely, Af'^ is a gcodesically convex subset of the L^-Wasserstein space. 



Lemma 2.3 ([9j Example 1.7]). For X G Sym^((i, M), we define a symmetric positive 
dcEnitc matrix X^/"^ so that X^l"^ ■ X^l"^ ~ X. For N{m, V) and N{n, U), define a symmetric 
positive definite matrix 

w = {w^j) = u^iu^vu^y^u^ 

and the related function 

yV(a;) = —{x — m, W{x ~ to)) + {x, n) . 

We denote the gradient of W by VW. Then, {\d,VW)^N{m,V) is the optimal transport 
between N{m, V) and N{n, U). If we moreover set 

l{t) = (1 - t)m + tn, W{t) = ((1 - t)E + tW)V{{l - t)E + tW), 

then {N{l{t), W^(t))}tg[o,i] is a geodesic from N{m, V) to Nln, U). 



Lemma 12.31 enables us to obtain the i^-Wasserstein distance on A/'^ 



Page 4 of [m 



Theorem 2.4 ([4], [7], [10], [H]). For N{m, V) and N{n, U), we get 

W2(.N{m,V),N{n,U)f = \m ^ + trV + trU - 2tT (u^VU^Y . 

We call W above a (unique) linear transform between N{ni,V) and N{n,U). Let 0{d) be 
the set of orthogonal matrices of size d. For P £ 0{d), we denote by M'^{P) the subset oi M''' 
whose covariance matrices are diagonalized by P. 

Corollary 2.5. For any P e 0{d), (7VA''(P), W2) is isometric to x (]R>o)'^. 

Proof. For A^(m, V), iV(n, [/) G 7V''(P), there uniquely exist {X,}f^i, {crjf^i C M>o such 
that 

1/ = Pdiag[Ai, . . . , XdVP, U = Fdiag[ai, . . . , a^VP 
By Theorem 12. 4[ we have 

d 

W2{N{m, V), 7V(n, U))^ = \m - + ^(A, - a,f. 

i=l 

Therefore a map identifying N{m,,V) with (m, (Ai, . . . , A^)) is an isometry from Af'^{P) to 
R'^ X (R>o)''. □ 

Remark 2.6. In tiie L'^-Wasserstein geometry, Af^ is isometric to a Euclidean upper half 
plane. While in the Fisher geometry, J\f^ is isometric to a hyperbolic plane with constant 
sectional curvature —1/2 (see [1].) 

2.2. Riemannian length space 

Next, we give Af"^ an L^-Wasserstein metric. See [3] for more detail. 

Definition 2.7. Let {■,-)y and \ ■ \y denote an inner product and a norm on a vector space 
Tiy. A subset M of a length space {N, dist) is called Riemannian if each x G M is associated 
with a map exp^ : Ti^ — > N defined on some inner product space Ti^ which gives a surjection 
from a star-shaped subset ICx C Hx onto M such that the curve Xs = exp^. (sp) defines an 
(afhnely parameterized) minimizing geodesic [0, 1] 3 s ^—^ Xs linking x ~ xq to y ^ xi for each 
p G ICx- We moreover assume that there exists q G ICy such that Xg ~ expj^(l — s)q and 

dist(exp^w, expyZ;)2 < dist(a-, y)2 - 2{v,q)y - 2{u,p)x + \u\l + \v\l), 

for all u £ Hx and v e Tiy as \u\x + \v\y 0. Dependence of these structures on the base points 
X and y may be suppressed when it can be inferred from the context. 

It was shown in [3l Proposition 4.1] that 7'|'^(]R'') forms a Riemannian length space with the 
following methods. 

Take (A^,dist) = ('P2(K'^), W2) as our complete length space and the subset M = "Pf (M'*). 
Fix p e M . Let spt(p) denote smallest closed subset of R'' containing the full mass of p, and let 
Vlp C denote the interior ofthe convex huU of spt(p). We take Hp = TL^-'^iW^ ,dp) C C^;l{np) 
to consist of those locally Lipschitz continuous functions on fJp whose first derivatives lie in 
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the weighted space L^(R'*, dp; R''), modulo equivalence with respect to semi-norm 



And the exponential map is defined by 



|VV(a;)|2dp(x) 



exppSi/j = [id + sVV'ltJP- 

Furthermore, they remarked if M' C M is gcodcsically convex, meaning any geodesic lies 
in M' whenever its endpoints do, then M' is itself a Riemannian length space with the same 
tangent space and the exponential map as those of M , but the star-shaped subset is given by 

K,'^ ~ {p <E ICx I cxp^p e M'}. 

Lemma [2.31 implies that is a geodesically convex subset of 7'|'^(R''). Therefore should 
also be a Riemannian length space, especially a Riemannian manifold. If W is the linear 
transform between N{V) and N{U), then exp^(-y)S-0 = N{U) if and only if Wip is same as 
V(yV — I • P/2). Identifying linear transforms as their coefficients, we treat the tangent vector 
space at each point as Sym((i, M). Namely, we can identify the tangent space at N{V) to Af^ 
with Sym((i, M) based on the idea 

expjv(y) tX = iV(C/(0), where U(t) = ((1 - t)E + tX)V{{l - t)E + tX). 

Moreover, the inner product becomes a Riemannian metric g on Ag*, whose Riemannian 
distance coincides with the L^-Wasscrstein distance. Its expression of g is given by 



9N{y)itX,tX) 



\tXx\'^dN{V){x) = t^trXVX. 



Theorem 2.8. Let Mq be the space of Gaussian measures with mean over R'^. Then, 
becomes a C°° -Riemannian manifold of dimension d{d +l)/2 and there exists the L^- 
Wasserstein metric g. If we identify the tangent space at N{V) to Mq witli Sym((i, R) by 

expjv(y) tX = N{U{t)), where U{t) = ((1 - t)E + tX)V{{l - t)E + tX), 

then it is explicitly given by 

gNiv){X,Y)=trXVY. 
Theorem 12.81 shows that {e+, , /y }i<i<j<d is a set of normal vectors. 

3. Proof of Theorem[r^ 

In order to calculate the sectional curvatures of (A/jf, g), where g is the L^-Wasserstein metric, 
we need some lemmas. 

Lemma 3.1 ([5j Theorem 3.68]). Let {M,g) be a Riemannian manifold. For any p G M, 
{w, v} is an orthonormal basis of a 2-plane in the tangent space at p. Let 

Cr{0) = cxppr{ucos6 + vsinO), 

and L{r) be the length of the curve Cr- Then the function L{r) admits an asymptotic expansion 

L{r) = 2^r - ^^^r^ + o{r^)^ , as r \ 0, 

where K{u, v) is the sectional curvature of the 2-plane spanned by {u, v}. 
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Lemma 3.2. For A,B e {e+, e.ij, /ii}i<j<j<rf, < r < 1 and 9 e [0, 2tt], 

Cr{0) = exp^(y) r{cos9 ■ A + sin6 ■ B) 
is a Gaussian measure whose covariance matrix X ~ X{r,9) ~ (xap) is given by 

X = [E + r{coae-A + sm6-B)] - V ■ [E + r{cose ■ A + sinO ■ B)], (3.1) 
where E is the identity matrix. 

Proof. It is clear by Lemma [^751 □ 
Proof of (1) and (5) 

If we choose A = e+,B = or A ~ eij,B = Cki in ()3.ip . then N{X) belongs to Nq{P). Since 
A/'o (P) is a flat manifold by Corollary 12. 5( the curvatures vanish. 

A strategy for proving the remaining case is as follows. We first calculate 

W{0o,9)^W2{Cr{0o),Cri9)f and W{0o) = lim 

— >6o u 



Then we get 



L{r) 



w{e)^de. 





Finally we use Lemma 13.11 to obtain the expression of the sectional curvatures. Without loss 
of generality, we may assume P = E. That is to say, 



-5^11 + Edd g . . — Ejj ^ _ Ejj + Eji 

VAi + Xd ' y^X'i + Xj ' ' ^A,; + Xj 



e+ = , , , = '/^ fij = —^==^, and V = diag[Ai, . . . , Ad 

because we have 

WiOo, 9) = trX(r, O^) + trX(r, 9) - 2tr (x(r, 0o)^^('^, e)X{r, 9o)^] 

and the value is invariant under taking conjugation with any orthogonal matrix P. 

For a general symmetric positive definite matrix X, it is hard to get a concrete expression 
of X^^^. But if the matrix is size of 2 x 2, the next lemma enables us to obtain the value of 
the trace and the determinant of X^^^. 



Lemma 3.3. Let M e Sym(2,M), then 

{trMf = trM^ + 2 det A/. 

Proof. Setting 

A/ = L , wc obtam M = \ , ^ ,s X ^ { 

Therefore, we get 

trA/2 + 2 det M = + 6^ + - 2{ab - c^) = (a + bf ^ {irMf . 



We set 

rcos6l rsinfl 
c^j{r,9) = —;====, s^j{r,0) = 



\J Xi + Aj ^/AT+Aj 
for 1 < i, j < d, 9 G [0, 27r] and sufficiently small ?■ > 0. 



□ 
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Proof of (2) and (8) 

For (2), wc take A = fid, B = and / = {1, d}, whereas, for (8), take A = fij, B = eij and 
/ = Then we notice that for any a,f3 ^ I, (a, /3)-coniponents of X are independent of 

the variables r and 9. If we set 

for {a, P} = /, we obtain 

W{eo, 9) = ivX{9o) + iiX{9) - 2tr (x{9^Y^ X{9)X{9o)^Y . (3.2) 
For (2), using Lemma 13.3) we conclude 



„2„:„2/a a\ „„j W(9q,9) 



W{9a, 9) = Ar' sixi\9 - 9a) and lim 

0^00 [u — 9q) 

It follows that L{r) = 27rr, proving K{e+, fid) — 0. 
For (8), in a similar way, we have 

Wi9o, 9) = sin^ - ^o) - Tx^tf^^'^^ + oi\9 - 9^1% 

I [\ + Aj)^ar[tlo,tl) 

where 

0^(6*0,61) [{1 + c,j{r,9o)){l + c,j{r,9)) + s,j{r,9o)s,j{r,9)] 

+ Xj[{l- c,^{r,9o)){l - c,j{r,9))+s,j{r,9o)s,j{r,9)] . 

Since the limit of ar{9o, 9) exists as ^ 6*0 and 

0^(00,^0) = (A, + Xj){l + r^) + 2(A, - Aj>cos0o, 



we have 



lim 



Wi9o,9) _ 2 4r4A,A, 



= r — 



It follows that 



e^e„{9-9o)^ (A, + Aj)2a,(0o, ^o) ' 

^ ^ Jo V (A, + A,)2a,(0,0); 

2 (A. + A,)%;(M) +""' ')'"'■ 

Because ao(0, f?) = A.; + Aj, using Lcmma l3.1l and the bounded convergence theorem, we obtain 

12A,;A,- 



, fij) 



-^'^^^^"(A, + A,)3- 
Proof of (3) and (7) 

For (3), assuming i = 1, take A e+, B ~ fij and / = {l,j,d}, whereas, for (7), assuming 
J < take A ~ Cik, B — fij and / = {i, j. A:}. Since for any a, (3 ^ I, (a, /3)-components of X 
are independent of the variables r and we obtain 

W{9^,9) 

= trX{9o) + trX(6l) - 2tr [x{9q)^ X{9)X{9o)^-^ ' 

— 2 \ 

= trr(0o) + trr(0) - 2tr (y(0o)^?(e)?(^o)^) ' + / \ {cos9 - cos^o)', 
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where 



' X. 



and {q;,/3,7} = /. Using Lemma |3.3[ we conclude 

xnrla n\ a2 ■ 2^^n n\ '^'^ A^A/j sin^(6' - 6*0) , ,„2x 

2 ar(6^, t'o) (Aa + A,3)(Aq + A^j 

where 

ar{9, 0o) = Aa(l + CQ^(r, 6'o))(l + CQ^(r, 0)) + sitiOq sm9 + \p. 
Since the hmit of ar{9o, 0) exists a& 9 we have 



It follows that 

L(r) = 



2" ^ 1 AaA 



V 2ar{9,9){X^+Xp){X^ + X,)^ ^ ^' ' 



Because ao{9,9) — (Aq + A/j), using Lemma 13.11 and the bounded convergence theorem, we 
obtain 



K{A,B) = 



SXaXp 



(Aa + A/3)2(Aa + A^)' 



We can prove the case of i 7^ 1 and j = d in a similar way. 
Proof of (4), (6) and (9) 

We take (A,S) in jSH) as (e+Jki) ({1, d} U {fc, /} = 0), (e.jjki) {{i, j} U {kj} = </}) and 
ifijifki) ({*7 j} U {fc, ^} = 0) in this order. Moreover we set I = {l,d} in the case (4) and 
/ = {i,j} in the case of (6) and (9). We notice that for any a,/3^I, (a, /3)-components of X 
are independent of the variables r and 9. If we set 



^Xpa XppJ ' \Xlk Xll J ' 

we obtain 

W{9o,9) ==trX,(0o)+trX,(0)-2tr(x,(0o)^^c(^)^c(eo)^)' (3.3) 
+ trXsiOo) + trX,(^^) - 2tr (^Xsi9o)^ Xs{9)Xs{9o)^^ 
where {a,/?} = /. Using Lemma |3.3[ we conclude 



limi^=riimi:^^^°)^' 

,0^00 9 -9n 



\2 



It follows that L{r) = 2-Kr and K{A, B) = 0. 
Proof of (10) 

Without loss of generality, we may assume j < k. Taking A and B as and /^j. in (|3.ip 
respectively. We notice that for any a, f3 ^ {i,j,k}^ (a, /?)-components of X are independent 
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of the variables r and 0. If we set 

Xji Xjj Xjk 
■^ki -^kj -^kk / 



a, + \jc,j{r,6f + \ks^k{r,ef {\ + Aj)cy(r,6') (Ai + \k)sik{r,ey 

^2 



{X^ + )c„- (r, 6*) Aj + A,c,j (r, fl)^ AjC„- [r, 0)s,k (r, 6*) 

(Ai + Afc)sife(r-,6') X,c,j{r,e)sik{r,e) \k + Ksik{r,eY 

we obtain 

iy(6lo, 9) = trX(6lo) + trX(6l) - 2tr (x(0o)^^ (6*)^ (^o)^) ' • (3.4) 

For the value of the last term in (|3.4p . Lemma TS. 31 can not be used as the size of matrices is 
3x3. 

We define some notations: 

A = Ae„{e)^x{e,,)"^x{e)x{e^)"- 
B^Bg„{e) = (x{eo)"^x{e)x{eo)l 

{(Tq = cFeo{(^)aYa=i ■ cigcnvalucs of B 
foo (0) = trS = (Ti + 0-2 + era 
ge.ie) ^ trA = + al + al 
heoiO) crio-2 + o-20'3 + (Tsai 

/ n\ 2 2 I 2 2, 22 

i:)e„(6') = dots (dct^)^ = 0X020Z 
Rewriting p.4p with the Taylor approximation of f0„{-) at flpi wc obtain 

w{eo,e) = -24(0o)(^^ - do) - f;,'„{eo)(e - e^f + o{\e - e^]^). 

Since we can get the values of 5, and D without information of X^/^, we compute /' and /" 
by using these values. 

We calculate f'g^ida) first. Differentiating B0g{6) ■ Be^^iO) = Ag„{6) with respect to 6, we have 

B',^^{e)BgM + Beo{o)B',^{e) = A'g^^ie) = x{eo)^x'i0)xieo)i. 

After multiplying Bg„{9)^^ from the left, taking the trace gives 

trB',^^{eo)+tr{BoMB'g„{e„)BeM-') = Ve,%) 
at 9 ^ 9q. Because trX(6') is constant, &i 9 = 9q the right hand side is equal to 

tv{X{9o)"^X'{9^)X{9^)"^X{9o)-^)^tvX'{9a) = (trX(0))' 

Therefore we conclude 

4(^o) = 0. (3.5) 
Next we compute fg^{9o). Differentiating p = ,9 + 2/i at 6* = 6*0, we have 

2fgMf'e,{9^) = g'e,{9o) + 2h',^^{9^), 

proving 

2/1^0(^0) = -.9^0(^0) 
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because of p.5p . Differentiating once more, 
Because of p.Sp . we get at 6* = 6'o 



We compute directly 



Q,/3e{ij,fc} 



This enables us to get the derivatives of geo{d)- Because Bo„{6o) ~ X{9o), using the relation 

det{tE - B) = t^ -f ■ f + t- h - D, 

we have 

heo{Qo) = X! {xaa{0o)xpp{9Q) - Xa0{9QY) . 

a,/3G{ij,fc} 

While it is hard to compute Be^ (9) directly, it is also hard to know the value of hg^ (9). We want 
to derive hg^{9) without the information of Bog{9). So differentiating h? = Lp + 20 f twice, we 
have 

2ih'g^X(^)f + 2heMhl{0) = ^liO) + ^D'g^{9)fg^{9) + 2Dl{9)feM + 2DgMfeM 

Ai 9 = 9q, we have 

, . I, , ^ + 'iDl,{0^)feM + 2DeMfl,{9,) 

Ah0„{9o) 2he„[9o) 

In order to analyze p.7p . wc consider Dgg(9) and ipgg[9). From the definition, we can compute 
Deg{9) directly as 

Deo{9) = XrXjXk [1 - (cy(r, 9o)^ + s^k{r, ^o)')] [l - (Q,(r, 9)^ + s,kir, 9)^)] . 
We next consider (peg (9) . Using the equation 

dct (tE - A) = det X(6io) • det{tX{9or^ - Xi9)), 

and the relation 

det{tE - A)=f -f ■ g{9) +t-ip-D^, 

we conclude 

ipeM = det{X{9o)X{9)) ■ tiiY{9o)Y{9)), where Y{9) = X{9)-\ (3.8) 

Since (|3.8p depends only on X{9), we can obtain the value of (peg{9). Therefore we can now 
specify the value of /ig',(^o) in p.7p . 
Inserting (|3.7p into (|3.6p . we obtain 

ar[t/o) 
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where 



aM =2 [feMheM - DgM] (3.9) 
=2(A, + Aj)(A, +Afc)(Afc + A,) 

+ [A^ + A^, + AXjXk + K\, + hXk + (Aj - Afe)(Aj + Afc + 3Ai) cos6io] , 

Pr{e^) =heM9W - + + w^„{eo) (s.io) 

= -2r2(A, + A,)(A, +Afc)(Afe+A,) 

- r'^ [(Aj + Afc)(A, + Aj + Afc) + (A^ - \k){\j +Xk+ SA,) cos 6*0] . 



Therefore we have 



W{eo) = Urn 



If we set 



L = 2(A, + Aj)(A, +Afc)(Afc + A,), 

a = A^ + A^. + 4AjAfe + A^Aj + A.Afc + (A^ - Xk){\j +\k + 3A,) cos( 
b = {Xj + Xk){X^ + \j + Xk) + (Aj - Afe)(Aj + Afc +3A^)cos6', 



we have 



L{r) 



r 1 



r^(6-a) 
2(L + r2a) 



+ o(r^) dO 



Using Lemma |3. II and the bounded convergence theorem, we obtain 



2tt 



K{u, v) 



■•2k: 



a. ~ b ,„ a — b 
hm — —de = 27r-— — , 



which impHcs that 



K{h,,Uk) = 



r\o2(i + r2a) 
SAfcAj 



{X, + Xj){X,+Xk){Xk + Xi)- 
This completes the proof of Theorem ll.il 



4. Remarks to Theorem ] 1.1\ 
In this section we consider the case c? = 2 in particular. 



4.1. Geometric interpolations of Theorem \l.l\ 

Lemma 4.1. Any V = (wy ) S Sym(2, R) is diagonalized by some special orthogonal matrix. 
In other word, there exists some S R such that the rotation matrix 



m 

diagonalizes V. 



cos 6 — sin 6 
sin 6 cos 9 



Proof. In the case of W12 = 0, we set 9 — Q. While in the case of V12 7^ 0, since 

'^R(9)VR(9) — ( "^"^^ ^ ^ ^'^^ sin 26* + V22 sin^ 6* U12 cos 26* + 2"i(uii - U22) sin 26*^ 
\^i;i2 cos20 + 2^-^(wii — W22) sin26' -(;22 cos^ — z;i2 sin 26* + vn sin^ 

'^R{9)V R{9) is a diagonal matrix if and only if 

V12 cos 20 = -2^^{vii - V22) sin 20 4^ cot 20 = (un - W22)/2wi2. (4.1) 
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Because cot 29 can take any value, (|4.ip always holds true. □ 
For a, /3 > 0, we denote 

{a,i3-e)^N[R{e){^l p^^R{0)y 

We abbreviate M^{R{e)) as M^{e). We also set A = {(A, A; 61) | A > }. Since (a, /3; 7r/2 + 6*) = 
(/3, a; ^), the expression (a, /3; 9) is not a global coordinate system. Even if we consider under 
modulo 7r/2, there is no uniqueness of diagonalizing matrix if a is equal to (3. 

Throughout this section, we fix p = (a,/3;0). We regard Gaussian measures (a,/J;0) as 
ellipsoids: (a, /3) specifies the length of the axes with the angle 9 of major and minor axes. 
Let X, Y and Z be matrices defined by 

where 7 = (a^ + /3^)^/^. Using this expression, we get 

exp^rX = iV([/), t/=-(^ /32(7-r)2j' 

exp,.y = iV(F), V^-^[^ ^^3/ «V+/3vj' 

exp,.Z = iV(M^), H^=:^("'^^'^^' PH,\rr)- 

We notice that y changes the axial angle of the ellipsoid, while X and Z do not, see Figure 1. 






Figure 1 

Consequently, X,Z e TpJ^Q{0) and K{X,Z) = by CoroUarv 12.51 For changing the axial 
angle of ellipsoid, the sectional curvature can not vanish. 



4.2. Correspondence to other results 

First, we consider the correspondence to the result of Otto |13j . He obtained an explicit 
expression of sectional curvatures of 7^|'^(IR'^) formally. By making this method rigorous, wc 
give an explicit expression of sectional curvatures of Af'^ in |15j . He introduced a manifold Ai 
which consists of all diffeomorphisms of M.'^ and an isometric submersion from M into 7'|'^(M'*) 
(he also sloppied about a differential structure oi M.) He defined a metric g* on A4 which 
carried the geometry of the L^-space. Therefore, {A4 ,(?*) is flat. Using O'NciU's formula 
he showed the sectional curvatures of 1^2"^ (Mf^) is given by 

if (V'l, V^2) det {gpi^P^, tPj)) = ^ p\u\^ > 0, 
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where p £ 7'|'^(R'') and ipi, ip2, "0 ^re tangent vectors at p given by 

li = VV' - [VV^i, V?/'2] and div(p(V7/' - [V?/'i, V'02])) = 0. 

This guarantees that 7^|'^(R'') is a space of non-negative curvature. In addition, K{ipi,tp2) = 
if and only if Hess?/"! and Hess'!/'2 pointwise commute. For X, Y and Z in (|4.2p . let A", 3^ and Z 
be functions whose gradients are X, Y and Z, respectively. Since Z is pointwise commutative 
with the Hessian of any functions, K{X,Z) = K{y,Z) = follows. In the case of X,y, we 
demonstrate that Theorem 11.11 coincides with Otto's result. 

Let po be the standard Gaussian measure on R^, that is po = (1, 1; 6*). Moreover, we define 
a Gaussian measure p and a diffeomorphism 4" respectively as follows: 

p^ia,P;e), vl/(.^) ^ ii? (^"J 

where 7 = (a^ + and i? — R{6). Then, the submersion sends ^' to p. We choose tangent 

vectors '0iiV'2 at p as A" and 3^, corresponding to X and Y respectively. In terms of Otto's 
result, we conclude that 

gp{ipi,tpj) = Sij (i!,j = i,2), 

[VVi,V^2](a;)-^i?(^J 
4^/0 a2 



Finally, we obtain 



4det(gp(i/'i,V2)) J 



(a2 + /32)3- 



Thus we confirm the equivalence between Theorem 11.11 and Otto's result. In [15] . the sectional 
curvature of A/''' was also obtained using Riemannian submersion. 



4.3. Mq as Alexandrov spaces. 

Next, we consider the correspondence to results when we rcg ard Pf'=(M'') and Af^ as 
Alexandrov spaces. Details can be found in [15] . 

It is well-known that L^-Wasserstein space over an Alexandrov space of non-negative 
curvature is also an Alexandrov space of non-negative curvature (see jl41 Proposition 2.10].) 
Therefore (■p2(K'^), 14^2) is an Alexandrov space of non- negative curvature. Since Mq C 7^2 (K'') 
is a geodesically convex subset, (A/Jf , W2) is also an Alexandrov space of non- negative curvature 
(But it is not complete, the completion of Af"^ is given in |15| .) 

Lott and Villani made Otto's results rigorous by looking at the space of probability 
measures as an Alexandrov space. They treated the space of probability measures V2{M) 
over a smooth compact connected manifold Af , and proved that M has non-negative sectional 
curvature if and only if 7-'2(M) has non-negative Alexandrov curvature ([8j Theorem A. 2].) 
They moreover defined the angle between the geodesies in 7'|'^(M) ([8j Theorem A. 17].) We 
demonstrate it in the case of M as R'', while R"* is not a compact manifold. Fix p £ 7'|'^(R''). 
If (/) is a function on R'' so that cf) + \ ■ p/2 is convex, then it is regarded as a tangent vector of 
1^2^ {M.'^) at p. Let (j> and ip be such functions. If we set 



fi{t) = [id + t(P]fp, v{t) = [iA + ti;]^p 
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for t G [0, 1], then fi{t) and iy{t) are geodesies starting at p. The angle between and is 
given by 

, jR.(V0(x),VV>(x))dp(a:) 
cosZ(//,i/) = . I— =■ 

Corresponding to this, we can measure the angle between Nq{9) and A/'q ((p). 

Proposition 4.2. For 9 and ip e (-7r/4, 7r/4], the angle between J^q{9) and 7VJf(i^) is 
2\9~ip\. 



Proof. Let 9 and be as above. For any a, (3,X > 0, 



W^2((a, /3; 0), (A, A; 0))^ = 2 (^A - j + -{a ~ (3) 

Hence, the distance from (a, (3; 9) to A is (a — P)/^/2 and the image of the nearest point 
projection is 

/a + l3 a + l3 
P= = 

Let X{9) and X{lp) be symmetric matrices given by 
then we get 

exp X{9) = (a, /?; 6*), exp X{lp) = (a, /?; v?). 



Since 



and 



we have 



g,{X{9), X{<f)) = I^±^trX(0)X(^) = ^{a (if cos 2(0 - ^) 
X{9)) = gp(X(^), = i(a - /?) 



2 



-cos2(.-^). (4.3) 



Vgp(X(^?),X(0))g,(X(^),X(^)) 
Therefore we conclude 

Z«(0),<(y.)) = Arccos ^ .gp(^W,^(y^)) ^ ^ _ |_ 



□ 



Page 15 ofim 



References 

1. S. Amari, Differential-geometrical methods in statistics, Lecture Notes in Statistics, vol. 28, Springer- 
Verlag, New York, 1985. 

2. Y. Brenier, Polar factorization and monotone rearrangement of vector-valued functions, Comm. Pure Appl. 
Math. 44 (1991), no. 4, 375-417. 

3. J. A. Carrillo, R.J. McCann, and C. Villani, Contractions in the 2-Wasscrstein length space and 
thermalization of granular media. Arch. Ration. Mech. Anal. 179 (2006), no. 2, 217-263. 

4. D.C. Dowson and B.V. Landau, The Frechet distance between multivariate normal distributions, J. 
Multivariate Anal. 12 (1982), no. 3, 450-455. 

5. S. Gallot, D. Hulin, and J. Lafontaine, Riemannian geometry, third ed., Universitext, Springer- Verlag, 
Berlin, 2004. 

6. W. Gangbo and R.J. McCann, The geometry of optimal transportation. Acta Math. 177 (1996), no. 2, 
113-161. 

7. C.R. Givens and R.M. Shortt, A class of Wasserstein metrics for probability distributions, Michigan Math. 
J. 31 (1984), no. 2, 231-240. 

8. J. Lott and C. Villani, Ricci curvature for metric- measure spaces via optimal transport compact alexandrov 
spacess. Annals of Mathmatics (to appear). 

9. R.J. McCann, A convexity principle for interacting gases. Adv. Math. 128 (1997), no. 1, 153-179. 

10. M.Knott and C.S. Smith, On the optimal mapping of distributions, J. Optim. Theory Appl. 43 (1984), 
no. 1, 39-49. 

11. I. Olkin and F.Pukelsheim, The distance between two random vectors with given dispersion matrices. 
Linear Algebra Appl. 48 (1982), 257-263. 

12. B. O'Neill, The fundamental equations of a submersion, Michigan Math. J. 13 (1966), 459-469. 

13. F. Otto, The geometry of dissipative evolution equations: the porous medium equation. Comm. Partial 
Differential Equations'26 (2001), no. 1-2, 101-174. 

14. K.-T. Sturm, On the geometry of metric measure spaces. I, Acta Math. 196 (2006), no. 1, 65-131. 

15. A. Takatsu, Wasserstein geometry of Gaussian measures, in preparation. 

16. C. Villani, Optimal transport, new and old, Grundlehren der mathcmatischcn Wissenschaften, vol. 338, 
Springer, Berlin, 2008. 

Asuka Takatsu 

Mathematical Institute, Tohoku University 
Sendai 980-8578, Japan 

sa6ni2 1 ©mat h . tohoku .ac.jp 



